Symmetric Group Testing and Superimposed Codes 



Amin Emad, Jun Shen and Olgica Milenkovic 
University of Illinois, Urbana-Champaign, IL 



o 

(N 
< 

in 



C/5 



> 

ON 
ON 
(N 

00 

o 



X 



Abstract — We describe a generalization of the group testing 
problem termed symmetric group testing. Unlike in classical 
binary group testing, the roles played by the input symbols zero 
and one are "symmetric" while the outputs are drawn from a 
ternary alphabet. Using an information-theoretic approach, we 
derive sufficient and necessary conditions for the number of tests 
required for noise-free and noisy reconstructions. Furthermore, 
we extend the notion of disjunct (zero-false-drop) and separable 
(uniquely decipherable) codes to the case of symmetric group 
testing. For the new family of codes, we derive bounds on their 
size based on probabilistic methods, and provide construction 
methods based on coding theoretic ideas. 

I. Introduction 

Group testing is a combinatorial scheme developed for the 
purpose of efficient identification of infected individuals in a 
given pool of subjects HI. The main idea behind the approach 
is that if a small number of individuals are infected, one can 
test the population in groups, rather than individually, thereby 
saving in terms of the number of tests conducted. A tested 
group is said to be positive if at least one of its members tests 
positive; otherwise, the tested group is said to be negative. 
Each individual from a population of size N is represented 
by a binary test vector (or signature) of length n, indicating 
in which of the n tests the individual participates. The test 
outcomes are represented by a vector of size n that equals the 
entry-wise Boolean OR function of the signatures of infected 
individuals. The reconstruction task consists of identifying the 
infected individuals based on their test signatures, using the 
smallest signature length n. 

The work in was extended in 13, where the authors 
proposed two coding schemes for use in information re- 
trieval systems and for channel assignments aimed at relieving 
congestion in crowded communications bands. The coding 
schemes are now known as superimposed codes, including 
disjunct/zero-false-drop (D/ZFD) and separable/uniquely de- 
cipherable (S/UD) codes. For compactness, we henceforth 
only use the terms disjunct (D) and separable (S) to describe 
such codes. These two classes of superimposed codes were 
extensively studied - see 0-0 and references therein. 

Superimposed codes are inherently asymmetric with respect 
to the elements of the input alphabet: if all tested subjects 
are negative (i.e., zero), the output is negative. Otherwise, 
for any positive number of infected test subjects, the output 
is positive. Consequently, a zero output carries significantly 
more information than an output equal to one. For many 
applications, including DNA pooling and other genomic and 
biological testing methods with low sensitivity, a test may be 
positive only if a sufficiently large number of subjects test 



positive |4|. In particular, a test outcome may be positive if 
and only if all subjects are infected. 

A probabilistic scheme that makes the group test symmetric 
with respect to the all-positive and all-negative tests was first 
described in 0, 0. To the best of our knowledge, these are 
the only two papers dealing with "symmetric" group tests, 
although only within a probabilistic setting where the inputs 
are assumed to follow a binomial distribution. In addition, the 
method was only studied for two extremal parameter choices, 
using a game-theoretic framework in which the player's strate- 
gies (i.e., reconstruction methods) are fixed and involve some 
form of oracle information. 

A more recent approach to the problem of a nearly- 
symmetric group testing was described in 1101 . Threshold 
group testing assumes that a test is positive (or negative) if 
more (or less) than u (or I) individuals tested are positive, 
where 1 < I < u < N. A test produces an arbitrary 
outcome (zero or one) otherwise. The latter feature makes the 
testing problem highly nontrivial and substantially different 
from symmetric group testing (SGT). 

We are concerned with describing symmetric group testing 
in a combinatorial setting, extending the concept of symmetry 
using information theoretic methods and in developing ana- 
logues of D and S codes, termed D s and S s codes. Our 
results include bounds on the size of the test set of symmetric 
group testing, construction methods for symmetric D and S 
codes, and efficient reconstruction algorithms in the absence 
and presence of errors. Bounds on the size of D s and S s 
codes are based on Lovasz Local Lemma ifTTI and constructive 
coding theoretic arguments. 

The paper is organized as follows. Section [TT] contains a 
short exposition regarding symmetric group testing, while 
information-theoretic bounds on the number of required tests 
are derived in noisy and noise-free scenarios. In Section [TTT1 
a generalization of SGT (termed generalized group testing) 
is introduced by employing a lower and upper threshold, and 
information theoretic bounds on the number of required tests 
are derived. Section [IV] introduces symmetric superimposed 
codes and contains derivations on the bounds on the size of 
D s and S s codes. Finally, Section [V]presents some techniques 
for constructing symmetric superimposed codes. 

II. Symmetric Group Testing 

The use of SGT was originally motivated by applications 
in circuit testing and chemical component analysis 0. As 
an illustrative example, consider the situation where one is 
to test N identically designed circuits using only serial and 
parallel component concatenation. In the serial testing mode, 



one can detect if all circuits are operational. In the parallel 
mode, one can detect if all circuits are non-operational. If 
at least one circuit is operational and one is non-operational, 
neither of the two concatenation schemes will be operational. 
Detecting efficiently which of the circuits are non-operational 
would require a ternary output group testing scheme. 

The reasons for introducing symmetric group testing with 
ternary outputs are twofold. The first motivation is to provide 
symmetry in the information content of the output symbols 
zero and one. Note that in standard group testing, a zero 
output automatically eliminates all tested subjects from further 
consideration, which is not the case with the symbol one. In 
symmetric group testing, the symbols zero and one play a 
symmetric role. The second motivation comes from biological 
applications in which the sensitivity of the measurement 
devices is such that it can only provide a range for the number 
of infected individuals in a pool: for example, the output may 
be "0" if less than I individuals in the test set are infected, 
"1" if more than u > I individuals are infected, and "2" in all 
other cases. 

Throughout the paper we use the word "positive" ("neg- 
ative") to indicate that a tested subject has (does not have) 
a given property. For the asymmetric testing strategy, the 
outcome of a test is negative if all tested subjects are negative, 
and positive otherwise. The outcome of a symmetric test is 
said to be positive (denoted by "1"), inconclusive (denoted by 
"2"), or negative (denoted by "0"), if all the subjects tested 
are positive, at least one subject is positive and another one is 
negative, and all subjects are negative, respectively. 

Let N denote the total number of test subjects, and let T> de- 
note the defective set with cardinality \T>\ = rn. Furthermore, 
let Xz> denote the collection of codewords (or signatures) 
corresponding to T>; note that the length of each codeword 
is equal to n. Also, let y e {0,1, 2} n denote the noise-free 
observation vector (test outcome) equal to the ternary addition 
of the codewords of the defective set, where ternary addition 
is defined as follows. 

Definition 1: For a ternary alphabet {0,1,2}, we define 
ternary addition, +, via the rules + = 0, 1 + 1 = 1, and 

+ 1 = 1 + = + 2 = 2 + 0=1 + 2 = 2 + 1 = 2 + 2 = 2. 
Clearly, ternary addition is commutative and associative. 

Note that in general the ternary addition operator is more 
informative than its binary counterpart; consequently, one 
expects that the number of required tests in the SGT is smaller 
than the number of required tests in a similar asymmetric 
group testing (AGT) scheme. We hence focus on finding upper 
and lower bounds on the minimum number of tests in a 
SGT scheme that ensures detection of the defective set with 
probability of error asymptotically converging to zero. 

A. Noise-free Symmetric Group Testing 

In the noise-free case, the observation vector is equal to the 
superposition of the signatures of defectives, i.e., 

y = x n + x l2 H h x im Vij eD, (1) 



where "+" denotes ternary addition, and stands for the 
signature of the f h defective subject. 

Let the tests be designed independently, and let p denote the 
probability that a subject is part of a given test, independent 
of all other subjects. It was shown in [12] that for any rn = 
o(N), a sufficient number of tests for asymptotically achieving 
a probability of error equal to zero in the AGT setting is lower 
bounded as 

( N — m\ ( m\ 

i = l,2,-..,m (2) 



log( iV - m ) (* 
n > max v V \ 



■■(1 

where denotes all ordered pairs of sets (T>i,T>2) that 
partition the defective set, such that \T>i \ = i and \T>2 1 = m—i. 
In the above equation, I(X-p 1 : X-p 2 ,y) stands for the mutual 
information between Xj> x and (Xd 2 , y); for a single test, Xj) t 
(where t = 1, 2) is a vector of size 1 x \T> t \, with its k th entry 
equal to 1 if the k th defective subject in T> t is in the test and 
otherwise, while y is the outcome of the test. 

Also, a necessary condition for zero error probability in 
AGT was shown to be of the form lfl2l 

log ( N -™+>) 

n> max ° K ' t = l,2,..-,m. (3) 

By following the same steps in the proof of the above 
two bounds, it can be easily shown that the same sufficient 
and necessary conditions hold for the SGT scheme, except 
for the fact that the mutual information will evaluate to 
a different form given the change in the output alphabet. 
Furthermore, it can be easily shown that for a fixed value of 
m, log 



, {N-m+i\ 



i log TV and log fY") (") ~ ilogN 



as 

N — > oo; consequently, these bounds are asymptotically tight. 

In the following Proposition, we evaluate the expressions 
for mutual information in (O and (01 for SGT. 

Proposition 1: In the noise-free SGT for m > 2, 

h{X Vl -X V2 ,y) (4) 

' (l~p) m - l h((l- P y) +p m -'h(p l ) if 1 < i < m - 1 
h(p m ,(l-p) rn ) ' if i = m ' 

where \T>i\ = i, and where for any < C>7 < 1> MC) = 
-ClogC-(l-C)log(l-C) and/ l (C,7) = -ClogC-7log7- 
(l-C-7)log(l-C-7)- 

Proof: Let Wj (where j — 1,2) denote the number of 
ones in Xd-, or alternatively, the Hamming weight of Xx> . 
From the definition of mutual information, one has 

I S (X Vl ;X V2 ,y) = H(y\X V2 ) - H{y\X v ) = H(y\Xj, 2 ) 

2 m — i 

= -^2^2P(w 2 = k)P(y = j\w 2 = k) log P(y = j\w 2 = k). 

j=0k=0 

If 1 < i < m — 1, then 
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Fig. 1 . The behavior of «s (m) and (m) versus m in the noise-free and 
noisy scenario. 

h {Xt> x ; X V2 , y) 

= -P(w 2 = 0)P(y = 0\w 2 = 0) log P{y = 0\w 2 = 0) 

— P(w 2 = 0)P(y = 2| W2 = 0) log P(y = 2|w 2 = 0) 

— P(w2 = m — i)P(jj — 2\w2 = m — i) \ogP{y~2\w2 — m—i) 

— P(w2 = m — i)P{y = l\w2 = rn— i) logP(y = V\W2 = to— i) 

= (i- P r i / i ((i-p) ! )+ P m - t Mp 1 )- 

Otherwise, if i — to, then 

I s (X Vl ;X V2 ,y) = -(l- P ) m log ((1 - p)" 1 ) - p m log 
- (1 - p m - (1 - log (1 - p m - (1 - p) m ) . 

■ 

Similarly, it can be shown that for AGT, 

I A (X Vl ; X V2 , y) = (1 - p) m -^ ((1 - P y) . (5) 

In order to compare SGT and AGT, fix p and let 

( \ ■ PyX-D^Xv^y) 
a(m,p)= mm : 1 = 1,2, ■■■ ,m. (o) 

Using the above definition, the bounds in (0 and (01 asymp- 
totically simplify to n > a(mj>) anc ^ n — jfmj) ' respectively. 
Also, let a\(m) and as(m) denote the values of a(m,p) for 
the choice of p that minimizes the lower bounds in the AGT 
and SGT expression^ respectively. Fig. 1 shows the behavior 
of these parameters with respect to m. As can be seen, for 
m = 2, as [ 7 Z\ =1.5, but as m grows, this ratio converges to 
one. 

B. Symmetric Group Testing in the Presence of Binary Addi- 
tive Noise 

This type of noise accounts for false alarms in the outcome 
of the tests. In this case, the noise vector, z, is modeled as a 
vector of independent identically distributed (i.i.d.) Bernoulli 
random variables with parameter q; the vector of noisy obser- 
vations, y, equals the ternary sum of the vector of noise-free 
observations and the noise vector, i.e., y = y + z. Note that 
in this model, we assume that both 0's and l's may change 
to the value 2, while 2's remain unaltered in the presence of 

'Note that for a fixed value of m, the value of p that maximizes a(m,p) 
may differ for AGT and SGT. 



noise. This model applies whenever dilution effects may occur, 
since adding one positive (negative) subject may change the 
outcome of a negative (positive) group. 

Proposition 2: In the presence of binary additive noise, for 
any m > 2, 

I s (X Vl ;X V2 ,y) (7) 
'(1 - p) m -' l h({\ - pf{l - q)) + p m - l h(p l q) 

- [(1 -P) m +P m ] h(q) if l<i<m-l 

= ' 

h(p m q,(l-p) m (l-q)) 

- [(1 - p) m + p m ] h(q) if i = m 

Proof: It can be easily shown that if 1 < i < m — 1, 

H(y\Xv 2 ) 

2 m — i 

= -^2 ^2 P{w2=k)P{y=j\w 2 =k)\ogP{y=j\w2 = k) 
j=o k=a 

= (1 - p) m - l h ((1 - p)\l - q)) + p m - l h (p\) , 

where W2 denotes the Hamming weight of Xx> 2 - Similarly, 
if i = m, then H(y\X Va ) = h(p m q, (l~p) rn (l - q)). Also 
one has 

2 m — i 

H(y\X v ) = P ( w = tyP(y=j\w = k) lo g P(y=j\w = k) 

= [(l-p) m +p m }h(q), 

where w denotes the Hamming weight of X-p- Using these 
two expressions in the definition of the mutual information 
completes the proof. ■ 
Similarly, it can be shown that for AGT, 

I A (X Vl ; X V2 , y) =(1 - p) m - l h({l - p)\l - q)) (8) 
-(l-p) m h(q). 

Fig. 1 also shows the behavior of as(m) and «a(to) with 
respect to m, for the dilution noise model with q = 0.75. As 
can be seen, SGT outperforms the AGT scheme: for m = 2, 
the ratio is approximately equal to 4.5, and as m grows 

it decreases. Note that for small values of q in this model, 
SGT does not offer any significant advantage when compared 
to AGT: for q — 0, eqs. (|7]i and (JHJ are identical. 

III. Generalized Group Testing 

In many applications, the experiment cannot be modeled 
by AGT or SGT, and a more general model is required. We 
hence consider the generalized group testing (GGT) problem 
in which the outcome of a test equals if the number of 
defectives in the corresponding pool is less than or equal to 
rji (where r/i > 0), equals 1 if the number of defectives is 
larger than 772 (where 771 < rj 2 < m — 1), and equals 2 if the 
number of defectives is larger than 771 and is less than or equal 
to 7? 2 . Note that when -qi — rj 2 — GGT reduces to AGT, and 
when 771 = and 77 2 = m — 1 GGT reduces to SGT. 

Proposition 3: In GGT, and for any i — 1, 2, • • • , to, where 

to > 2, 




Fig. 2. The behavior of ctQ(m), as(m), and a^(m) versus m in the absence 
of noise. 
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Ic {Xvi ; Xt, 2 , y) — 



min(r^i .n 

E 

fc=0 

min(?72 ,7n- 

E 



p (k) h{p 1 (k,rn),p 2 (k,r]2)) 

n — i) 

Po(k) h{p 2 {k, m )) (9a) 



with 



Po(k) = 
Pi (Mi) 



/(I 



minf-i^i — k) 

E 

111111(2,772 — fe) 



P2(k,7] 2 ) = 1 - 



i=0 



(9b) 



(9c) 



(9d) 



As before, p denotes the probability that a subject is part of a 
given random test. 

Similarly to the case of AGT and SGT, one can define 

aa(m) as 

oec( m ) — max a(m,p,r)i,r) 2 ) (10) 

where a(m,p, 771 , 772) is defined in the same manner as 
a(m,p) in ©. Fig. 2 shows a(m) versus to for AGT, 
SGT, and GGT As can be seen, when to > 3, GGT 
outperforms SGT and AGT. For example, when m = 3, one 



has 
has 



ac(rn) 



Q S (m) 



1.6, and at m = 25, one 



1.4 and 24=| 
sa a °( m ) rj 1.6. The arguments that maximize 

a s (m) a A (m) ° 

a(m,p, 771 , 772) are denoted by p*, 77*, and an d are tabulated 
in Table 1 for different values of to. 



IV. Symmetric Superimposed Codes 

In this section, we introduce disjunct and separable sym- 
metric superimposed codes for the SGT scheme. We postpone 
the discussion of generalized disjunct and separable codes to 
the full version of the paper. 

Definition 2: Given two ternary vectors x and y, and a ternary 
addition operator, we say that y is included in x if x + y = x. 

Definition 3: Let the addition operator be the ternary addition 
described in Def. Q] A code C is a symmetric m-disjunct 
code if for any sets of binary codewords {xi,X2, ••• ,x s } 
and {y 1 ,y 2 ,-- - ,yj, the sum xi + x 2 + ... + x s being 
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Optimal GGT probabilities and thresholds for different 

VALUES OF m. 
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included in y 1 + y 2 + ... + y t implies that {xi,x 2 , • • • , x s } C 

{yi ; y 2; • • • .yJ. for an y s,t<m. 

Definition 4: A symmetric code C is an m-separable code if 
x i + x 2 + • • • + x s = yi + y2 + ■■• + y* implies s = t and 
{xi,x 2 , • • • ,x s } = {y l7 y 2 , • • • ,yj for any s,t < m. 

Henceforth, we refer to m as the strength of code C and 
use N(m) and n(m) to denote the number of codewords 
(the signatures of the subjects) and their length, respectively. 
The rate of a superimposed code of strength to is defined as 
Rm = '"s^") where log stands for the logarithm base two. 

rn n(m) ' G & 

Whenever apparent from the context, we use N and n instead 
of N(m) and n(m). 

A. Bounds on the Size of Symmetric Disjunct Codes 

In this subsection, we derive an upper bound on the size of 
D s (m) codes using probabilistic methods. 

Theorem 1 (Upper Bound): Let to > 2 be a fixed number 
and let n — > 00; if N is asymptotically upper bounded by 



AB s n , where A = 
1, then there exists 



1 fm 



and f?s = 



l/m 



> 



(m+l) 

a symmetric disjunct superimposed 
n, N, 771)— code. The rate of this code is asymptotically upper 



> 0. 



bounded by Rs(m) = 

Proof: Let C be a set of N codewords with length n, and 
let M. be a set of to + 1 codewords chosen from C. There are 
(rri+i) different possibilities for M.. For the i th possible choice 
of A4, define Ei as the event that at least one of the codewords 
in Ai is included in the ternary sum of the other to codewords. 
From this definition, each Ei is mutually independent of all 



except 



( N \ _ (N-(m+l)\ _ 
\m+lJ V m+l I 



other events. Suppose that 
if 



P(Ei) < p' for all i. Using Lovasz's local lemma 0T 

N \ fN -(m + l) 
,TO + 1/ \ m+l 



ep 



< 1, 



(11) 



then P (p| i Ei) > 0, where e is the base of the natural 
logarithm and Ei is the complement of the event Ei. In other 
words, if (fTTT i is satisfied, there exists a choice of N codewords 
that form a symmetric disjunct superimposed (n, N, m)— code, 
C. Note that if no codeword is included in the sum of to other 
codewords, then no codeword is included in the sum of any 
set of other codewords with cardinality smaller than m. 

Lemma 1: For all i one has 

2"(m+l) _ ( m + i) (2 m+1 - 2) 



P{Ei) < p' = 1 - 
and as n —> oo, one has 

p' ~ (m + 1) 



(m + l)!( 



2" 



1- - 



(12) 



(13) 



Substituting p' into ( fTTT ) and using the fact that 



AT 



AT- 



^"(m+i)! ' as ^ ~* 00 ' com pl etes the proof. 



Remark 1: Similarly, it can be proved that for m > 2, 
if iV is asymptotically smaller than ABpJ 1 , where £? A = 

■ „m+l 1 V™ 

2 m+i_ 1 > 1, then there exists an asymmetric disjunct 
superimposed (n, N, to) — code. The rate of this code is asymp- 
totically upper bounded by R A (m) = - logP A > 0. 
Consequently, 

Rs(m) m-log(2 m -l) 



R A (m) 

Rs(rn) 



l-log(2 



m+l 



1 



(14) 



The ratio ^(^) I s approximately equal to 2 when to = 2 and 
tends to 1 as to grows. 

B. Bounds on the Size of Symmetric Separable Codes 

In this subsection, we find an upper bound on the size of 
symmetric separable codes when to = 2. 

Theorem 2 (Upper Bound): Let to = 2 and let n — » oo; if N 
is asymptotically smaller than A'B's™, where A' — (2e)~a pa 

0. 569 and £>'s = (|) 3 ~ 1.387, then there exists a symmetric 
separable superimposed (n, N, 2)— code. 

Proof: No two distinct pairs of codewords of a separable 
code have an identical ternary sum. Let C be a code of N 
codewords with length n and let Ai be a set of four codewords 
of C. There are different choices for M.. For the i th choice 
of M., we define J5f as the event that at least two distinct 
pairs of codewords in A4 have an identical ternary sum. Using 
Lovasz's local lemma, if 

then P (P^ > 0, where p" is an upper bound on P(Ei 

1. e. P(Si) < p", for all i. 

Lemma 2: One has 



< 1, 



(15) 



P 



and as n —> oo, p" 



3 • 6 n - 6 • 4™ + 3 • 2 r ' 



l (2" - l)(2 r - 
'3(i) B . 



2)(2» -3) : 



Substituting p" into (fT5T > and using the fact that ( 4 ) — 
( N ^ i ) ~ f^V 3 , as iV — > oo, completes the proof. ■ 



V. Construction of Symmetric Superimposed 
Codes 

Due to space limitations, we only consider symmetric 
separable codes with to = 2 for which a construction based 
on coding theoretic methods is particularly simple. 

Proposition 4: The columns of a parity-check matrix of a 
linear code, with minimum distance of at least five, form a 
symmetric UD-2 code. 

Proof: Let Ci,C2, 03,04 denote four different columns of 
the parity-check matrix H. Furthermore, assume that Ci +C2 = 
C3 + C4 = c. At the positions where c is or 1 , all codewords 
Cx,C2,C3,C4 have a or a 1, respectively. If at some position 
i, c equals 2, then necessarily |cj*\c2*^| = {c^c^l = 
{0, 1} . If we denote the binary sum by 0, the former claim is 
equivalent to Ci©C2 = C3©C4. This contradicts the assumption 
that H is a parity check matrix of a linear code of distance at 
least five. Hence, the columns of H must form a UD2 code. 

■ 

From the Gilbert- Varshamov bound, one would conclude 



that 



i(« -3n+8) 



< 2" where n — k is the length of the 
code and n is the number of codewords. For n — k and n 1^^ 

enough, one would have n 3 ps 6 • 2 n ~ fc and n ~ \/6 ^23 \ 
From the sphere packing bound, one would conclude that 



2" < 2 fc 



Ed— 1 fn\ 
1=0 \i) 



i—k 



For n — » 00, this implies that n ^ 



243 

Comparing the above method to the construction in (3, one 
can see that the size of symmetric codes is twice as large as 
the size of asymmetric codes. 
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